Model pretrained on C4 using T5's unsupervised objective for ~500k steps, model
size is comparable to T5's base ~770m parameters.
